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(57) The present invention provides a recording 

medium and a recording/playback unit for use with the 
recording medium. The recording medium comprises a 
still image data area (102) capable of storing a 
plurality of still image data (VOB) pieces therein and 
an area (102) storing still image set management 
information (VOBSI) therein for managing a part or the 
whole of the still image data (VOB) in the still image 
data area as one still image set (VOBS). Each still 
image set (VOBS) has the corresponding still image set 
management information (VOBSI). 
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[0001] The present invention relates to a recording medium to and from which digital data may be written and 
read, to a recording unit recording digital data on it, and to a playback unit playing back digital data from it. 
5 Particularly, the present invention relates to an optical disk on which multimedia data, such as video data,' still 
image data, and audio data, may be recorded and to a recording unit and playback unit. 

[0002] A phase-change disk DVD-RAM (Digital Versatile Disc-RAM) with a capacity of several GB (Giga Bytes) has 
been introduced into the field of writable optical disks with a maximum capacity of about 650MB (Mega Bytes). As 
MPEG (MPEG2), the standard for coding digital AV (Audio and Video) data, is employed for practical use, DVD-RAM is 
now expected for use not only on computers but also as recording and playback media in the AV field. That is," it is 
10 predicted that DVD-RAMs will become media replacing magnetic tapes which have been used as standard AV recording 
media. 

[0003] (Description of DVD-RAM) Recently, as the recording density of a rewritable optical disk increases', not 
only computer data or audio data but also image data may be recorded on the optical disk. For example, on the signal- 
recording surface of an optical disk, the guide grooves in the form of projection and ditch have been provided 
15 conventionally. ■ 

[0004] In former days, signals were recorded only in the projection or the ditch positions. The introduction of 
the land-groove recording method has made it possible for signals be recorded in both the projection and the ditch 
positions. This method has achieved about twice as high density as before (For example, Japanese Patent Laid-Open 
Application No. JP-A-8-7282). 

2Q [0005] The CLV method (Constant Line Velocity) efficiently increases the recording density. A method such as the 
zone CLV method which makes the CLV method easier to control and implement was also devised and put into practical 
use (For example, Japanese Patent Laid-Open Application No. JP-A-7-93873). j 

[0006] One of major problems with an optical disk with an ever-increasing capacity is how to record AV data, 
including image data, and how to implement performance and new functions much higher than those of conventional AV 
equipment. 

[0007] With the advent of this large-capacity, rewritable optical disk, it is expected that tapes which have 
been used in most cases for AV data recording and playback will be replaced by optical disks. A shift in recording 
media from tapes to disks will have various influences on theTunction and performance of AV equipment. 

[0008] One of the most prominent advantages of the shift to disks is a great improvement in the random access 
performance. An attempt to make a random access to data on a tape involves rewinding one volume of tape which will 
30 usually take the order of minutes. This is much larger than the seek time (several ten milli-second , or less) of 
optical disk media. Thus, the tape cannot be used practically as a random access device. 

[0009] This random access performance of an optical disk makes possible the distributed recording of AV data 
which would be impossible on conventional tapes. 

[0010] FIG. 1 is a block diagram showing the drive of a DVD recorder. In the figure, reference numeral 11 is an 
35 optical pickup which reads data from the disk, 12 is an ECC (error correcting code) processor, 13 is a track buffer, 
14 is a switch switching input/output of the track buffer, 15 is an encoder, 16 is a decoder, and 17 is the enlarged 
view of a recording area on the disk. 

[0011] As shown in 17, the minimum unit of data recorded on the DVD-RAM disk is 1 sector=2KB. The ECC 
processor 12 performs error correction processing on 16 sectors = 1 ECC block. t 

40 [0012] The track buffer shown by 13 is a buffer used to record AV data at variable bit rates to efficiently 
record AV data on the DVD-RAM disk. This buffer acts as a buffer to resolve the difference between the DVD-RAM 
read/write rate (Va in the figure) which is constant the and the AV data bit rate (Vb in the figure) which varies 
according to the complexity of the contents (such as image data of video). 

[0013] More efficient use of this track buffer 13 allows AV data to be distributed on the disk. This is 
described below using FIGS. 2A and 2B. 

[0014] FIG. 2A is a diagram showing the address space of the disk. As shown in FIG. 2A, when AV data is 
recorded in separate contiguous areas [al, a2] and [a3, a4], supplying data, stored in the track buffer, to the 
decoder during the seek operation from a2 to a3 allows AV data to be played back continuously. FIG. 2B shows how 
data is accumulated into, and supplied from, the track buffer. 

[0015] AV data, which is read starting from al, is input into, and output to, the track buffer beginning at time 
50 tl. The amount of data corresponding to the difference in rate (Va - Vb) between the track buffer input rate (Va) 
and the track buffer output rate (Vb) is accumulated in the track buffer. This condition continues until data at a2 
is read (time t2). The amount of data B(t2), accumulated up to this time, is used as data that is supplied to the 
decoder until time t3 at which reading starts at a3 arrives. 

[0016] In other words, if the amount of data ([a1, a2]) accumulated before the seek operation is equal to or 
larger than a sufficient amount, AV data may be supplied continuously even if the seek operation happens. 

[0017] In the above example, data is read, or played back, from a DVD-RAM. The example also applies when data 
is written, or recorded, onto the DVD-RAM. 

[0018] As described above, if the data exceeding a sufficient amount is contiguously recorded on the. DVD-RAM, 
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continuous playback/recording is possible even if AV data is distributed on the disk. 



(Description of MPEG) j 

[0019] 1 Next, AV data is described. 

[0020] As described above, AV data recorded on a DVD-RAM uses the international standard called MPEG 
(ISO/IEC13818). 

[0021] A DVD-RAM, with a large capacity of several GB, is not large enough to store non-compressed digital AV 
data This means that AV data must be compressed before being recorded. One of the popular methods for 
compressing AV data is MPEG (ISO/I EC 138 18). A recent advance in the LSI technology makes it possible to implement 
an MPEG, codec (compression/decompression LSI chip), allowing the DVD recorder to MPEG-compress/ decompress 
data. 

[0022] 1 For highly efficient data compression, MPEG has the following two major characteristics: 
[0023] The first characteristic is that, in addition to the conventional compression method using the spatial 
frequency characteristics, MPEG uses a compression method using inter-frame time correlation characteristics for 
compressing video data. To compress data, MPEG classifies frames (also called pictures in MPEG) into three: I 
picture (intra-frame coded picture), P picture (picture using intra-frame coding and a reference to the preceding 
picture), and B picture (picture using intra-frame coding and a reference to the preceding and following pictures). 
[0024] FIG. 3 shows the relation among I, P, and B pictures. As shown in FIG. 3, the P picture refers to the 
immediately preceding I or P picture, while the B picture refers to the immediately preceding and following I or P 
picture. Also, because the B picture refers to the following I or P picture, the display order of pictures does not 
always match that (coding order) of compressed data as shown in FIG. 3. 

[0025] The second characteristic is that MPEG allocates an amount of coding dynamically to each picture 
depending upon the complexity of the image. The MPEG decoder has an input buffer and accumulates data in this 
decoder buffer, making it possible to allocate a large amount of code to a complex image which is difficult to compress. 
[0026] ' Audio data used on a DVD-RAM may be selected from the following three: MPEG audio data and Dolby 
digital data (AC-3) which are compressed and LPCM data which is not compressed. The bit rate of Dolby digital data 
and LPCM data is fixed. The size of MPEG audio data may be selected from several sizes in units of audio frames 
which are not so large as video streams. 

[0027] This AV data' is multiplexed into one stream using a method called a MPEG system. FIG, 4 is a diagram 
showing the configuration of the MPEG system. The reference numeral 41 is a pack header, 42 is a packet header, and 
43 is a payload. The MPEG system has a hierarchical structure consisting of packs and packets. A packet is composed 
of the packet header 42 and the payload 43. AV data, divided into several pieces each in an appropriate size, is 
stored in the payload 43 beginning at its head. The packet header 42 contains information on the AV data stored in 
the payload 43" it contains the ID (stream ID) identifying the stored data as well as the decoding time DTS 
(Decoding Time Stamp) with precision in 90 kHz and display time PTS (Presentation Time Stamp) of the data.included 
in the payload (For data such as audio data which is decoded and displayed almost at the same time, the DTS is 
omitted) A pack is a unit composed of a plurality of packets. Since one pack is used for one packet for DVD-RAM, a 
pack is composed of the pack header 41 and a packet (packet header 42 and payload 43). In the pack header is 
recorded the SCR (System Clock Reference) which is the 27MHz-precision time at which data in the pack is input into 
the decoder buffer. 

[0028] A MPEG system stream like this is recorded on the DVD-RAM, one pack per one sector (=2048 bytes). 
[0029] Next the decoder decoding the above-described MPEG system stream is described. FIG. 5 shows the 
decoder model (P-STD) of the MPEG system decoder. The refernce numeral 51 is an STC (System Time 
Clock) measuring the standard time used in the decoder, 52 is a de-multiplexer which decodes, or de-multiplexes, a 
system stream, 53 is an input buffer of the video decoder, 54 is a video decoder, 55 is a re-order buffer in which I 
and P pictures are stored temporarily to adjust the difference between the data order and the display order of I 
pictures and P pictures described above, 56 is a switch adjusting the output order of the I pictures and P pictures 
stored in the re-order buffer, 57 is an input buffer of the audio decoder, and 58 is an audio decoder. 
[0030] ' The system decoder having this configuration processes the above-described MPEG system stream as 
described below. When the time of the STC 51 matches the SCR described in the pack header, the de-multiplexer 52 
receives' the pack. The de-multiplexer 52 interprets the stream ID contained in the packet header and transfers the 
streams of data in the payload to the decoder buffer 53 or 57 for each stream. The de-multiplexer 52 also gets the 
PTS and DTS from the packet header. When the time of the STC 51 matches the DTS, the video decoder 54 gets 
picture data from the video buffer 53, decodes it, stores the I and P pictures in the re-order buffer 55, and 
displays the B pictures. When the picture the video decoder 54 decodes is an I or P picture, the switch 56 is 
switched to the output terminal of the re-order buffer 55 to output the preceding I or P picture from the re-order buffer 
55* when the picture the video decoder 54 decodes is a B picture, the switch 56 is switched to the output terminal 
of the video decoder 54. Like the video decoder 54, when the time of the STC 51 matches the PTS (there is no DTS for 
audio data), the audio decoder 58 gets one frame of audio data from the input buffer 57 and decodes it. 
[0031] Next the multiplexing method of an MPEG stream is described with reference to FIG. 6. FIG. 6(a) shows 
video frames, FIG. 6(b) shows the video buffer, FIG. 6(c) shows an MPEG system stream, and FIG. 6(d) shows audio 
data The horizontal axis, common to all figures, is the time axis. Data in each figure is drawn based on this time 
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axis. In the figure showing the video buffer status, the vertical axis indicates the buffer occupancy (amount of 
data accumulated in the video buffer) with the bold line indicating the chronological change in the buffer 
occupancy. The slope of the bold line corresponds to the bit rate, indicating that data is input into the buffer at 
a constant rate. A reduction in the buffer occupancy at a regular interval indicates that data is decoded at that 
time. The intersection of the dotted diagonal line and the time axis indicates the time at which the transfer of 
5 video frames to the video buffer is started. 

[0032] The following describes the operation with complex video data image A as an example. As shown in FIG. 
6(b), the data of image A must be transferred to the video buffer at time t1 that is earlier than the decode time (The 
time from the data input time t1 to the decode time is called vbv_delay) because image A requires a large amount of 
code. As a result, the AV data is multiplexed in the position of the video pack indicated by the shaded area in FIG. 

10 6(c). On the other hand, audio data, which does not require dynamic coding amount control as with video data, need 
not be transferred earlier than the decode time; in most cases, audio data is multiplexed some time earlier than the 
decode time. Therefore, for video data and audio data that are played back at the same time, the video data is 
multiplexed before the audio data. It should be noted that, for MPEG, all data except still-image data must be 
output from the buffer to the decoder within one second. This means that the maximum difference in the multiplexing 
time between video data and that of audio data is one second (Strictly speaking, the time needed for re-ordering 

15 video data may be added to the maximum time). 

[0033] In this example, although video data is multiplexed before audio data, audio data may be multiplexed 
1 before video data theoretically. When highly-compressed, easy-to-process video data is prepared and the audio data 
is transferred much earlier, it is possible to create such data. However, because of the limitation of MPEG 
described above, audio data may be transferred not earlier than one second. : 
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(Description of digital still camera) 



[0034] Next, a digital still camera is described. 

[0035] Recently, digital still cameras using JPEG (ISO/IEC 10918-1) have become popular. The popularity of 
digital still cameras is due to the fact that personal computers have rapidly come into wide use recently. Images 
25 taken by digital still cameras may be easily captured into personal computers via semiconductor memory, floppy disks, 
infrared light communication, and so forth. The still images captured into personal computers may be used in 
presentation software products, word processors, and internet contents. - ■ 

[0036] In addition, digital still cameras capable of capturing sounds have become used. The capability of 
recording sounds has given digital still cameras another advantage over conventional film cameras. 

30 [0037] FIG. 7 shows the relation between JPEG data recorded by a digital still camera and the directories and 
files on a PC (personal computer). 

[0038] As shown in FIG. 7, JPEG data is recorded in one file (with the extension code of "JPG"). When the number 
of files exceeds a predetermined number and it becomes difficult for the user to manage the files, they are usually 
organized into the directory structure, each directory including about 100 files as shown in FIG. 7. ; 

35 [0039] However, the number of still images that can be recorded by a digital still camera is limited by the 
recording capacity of the recording media such as flash memory or floppy disks. A large number of still images cannot 
be recorded. For example, when still images, each 50KB in size, are recorded! in the 100MB flash memory, the 
maximum number of still images that may be recorded at a time is as small as about 2,000 still pictures. 



(Description of digital VCR) 



[0040] Next, a digital VCR, in particular, a DVC which has rapidly become popular recently, is described. 

[0041] The introduction of the DVC has implemented new functions not provided on the conventional VCR. One of 
them is a recording in which video and still images are mixed. 

[0042] FIG. 8 is a diagram showing how the DVC records video and still images. 

[0043] As shown in FIG. 8, the DVC allows video and still images to be mixed in a sequential order on tape, 
allows video and still images to be alternately recorded, or allows still images to be recorded continuously just as 
they would on an album. 

[0044] However, the DVC, which is a tape medium, lacks random accessibility.. In addition, it has no management 
information similar to that used on the computer, making it difficult for the user to play back a particular still 
50 image the user wishes. 

[0045] The introduction of the DVD-RAM means a potential new AV equipment which solves the problem of limited 
number of still images of digital cameras and the problem of random accessibility of the DVC and which enables the 
user to process tens of thousands of still images freely. 

[0046] As described above, the DVD-RAM is expected as one of the next-generation AV recording media. The 
55 present invention solves the following problems which prevent the performance of the DVD-RAM from being maximized. 
The present invention also enables a DVD recorder to be implemented. The DVD recorder is thought of as the intended 
and most important application of the rewritable large-capacity optical disk DVD-RAM. ! 

[0047] The most serious problem of processing a large amount of still image data on the DVD recorder is that the 

i . ! it 
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amount of management information is extremely large. 

[0048] 1 The still image Jdata management information is described with reference to FIG. 9. 

[0049] . Access to still image data recorded on the disk requires information such as the address and the size of 
data the user is going to access. 

[0050] In addition, the addition of sound data as on a digital still camera requires information not only on the 
address and the size but also on the playback time of the sound data. Post-recording, which is recorded separately 
after still image data is recorded, also requires post-recording audio data management information. 
[0051] Access to the 4.7 GB data area, one sector at a time (1 sector = 2048 bytes), requires 4 bytes for the 
address, 1 byte for still image data, and 2 bytes for sound data; in addition, for sound data, another 2 bytes is 
required for the playback time. The post-recording of sounds requires twice as large management information, with 
the total management information area being 21 bytes in size. 

[0052] If 65000 still images are recorded and 21 bytes of management information is used for each still image, 
the size of the management information is calculated as: 

65000 x 21 bytes = 1365000 bytes 
The total of about 1.4 MB of management information is required. 

[0053] Although 1.4 MB of data is small as compared with the DVD recording capacity, the system controller 
(corresponds to the CPU of a PC) must always have this data in memory for use in random access. Despite a 
significant drop in the price of memory, it is unusual for AV equipment to have memory larger than one MB. And, it 
is impractical for AV equipment to have a battery backup for backing up the memory, larger than one MB, against an 
emergency. 

[0054] ! In a first aspect, the present invention provides a recording medium which minimizes the storage area for 
data management information to allow the recording area to be used efficiently, a recording unit which records data 
on the recording medium, and a playback unit which plays back data from the recording medium. In a second aspect, a 
recording medium according to the present invention comprises a still data image area (102) in which a plurality of 
still image data (VOB) pieces may be recorded and an area (102) in which still image set management information 
(VOBSl), managing the still image data (VOB) in a part and the whole of the still image data area as a gathering 
still image set (VOBS), is recorded. The still image set (VOBS) has corresponding still image set management 
information (VOBSl). 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0055] Fig. 1 is a block diagram of a DVD recorder drive unit. 

[0056] | Fig. 2A is a diagram showing the address space on a disk. 

[0057] * Fig. 2B is a diagram showing the accumulation amount of data in the track buffer. 

[0.058] i Fig. 3 is a diagram showing the relation of pictures in an MPEG video data stream. 

[0059] ' Fig. 4 is a diagram showing the configuration of the MPEG system stream. 

[0060] , Fig. 5 is a block diagram of an MPEG system decoder (P-STD). 

[0061] Fig. 6(a) is a diagram showing video data, FIG. 6(b) is a diagram showing a video buffer, FIG. 6(c) is a 

diagram showing an MPEG system stream, and FIG. 6(d) is a diagram showing audio data. 

[0062] . Fig. 7 is a diagram showing the still image management method in a digital still camera. 
[0063] Fig. 8 is a diagram showing the recording status of video and still images of a digital VTR. 
[0064] Fig. 9 is a diagram showing the configuration of still image management information. 

[0065] Fig. 10(a) is a diagram showing the directory structure, and FIG. 10(b) is a diagram showing the physical 
arrangement on a disk. . 

[0066] Fig. 11A is a diagram showing management information data. 

[0067] * Fig. .1 1B is a diagram showing stream data. 

[0068] Fig. 12 is a diagram showing the configuration of still image set management information 

[0069] 1 Fig. 13 is a diagram showing a link relation between still images and audio data. 

[0070] , Fig. 14 is a flowchart showing how to determine a still image address and how to check whether or not 
audio data is present. , 

[0071] Fig. 15 is a block diagram of a DVD recorder/player. 

[0072] I Fig. 16 is an diagram showing an example of a still image enable flag. 

DETAILED DESCRIPTION OF THE EMBODIMENTS . 

[0073] The present invention will be described more in detail using a DVD recorder and a DVD-RAM which are one 
embodiment of the present invention. In the description of the embodiment, the term "player" sometimes includes the 
function of a player as well. 



10 



EP 0 965 991 A1 

[0074] (Logical configuration of a DVD-RAM) First, the logical configuration of a DVD-RAM will be described with 
reference to FIG. 10. FIG. 10(a) shows the configuration of data on the disk viewed from the file system, and FIG. 
10(b) shows the physical sector address on the disk. The physical sector address begins with the lead-in area 100 
where reference signals or other media identification signals necessary to stabilize the servo are recorded. The 
lead-in area 100 is followed by the data areas 101 and 102. In this data area is written logically effective data 
such as video data, still image data, and audio data. The logical sector address is ended by the lead-out area 103 
where reference signals and so on are recorded as in the lead-in area. 

[0075] The data area begins with volume information area 101 which is management information for use by the file 
system. 

[0076] Data on the disk may be treated as directories or files via the file system as shown in FIG. 10(a). 

[0077] All data processed by the DVD recorder is placed under the V!DEO_RT directory which is immediately under 
the ROOT directory. 

[0078] The files processed by the DVD recorder are classified roughly into two: one management S file 
and a plurality of (at least one) AV files. ' 1 

15 (Management file) • i ! 

i 

[0079] Next, referring to FIG. HA, the contents of the management information file are described with emphasis 
on the management information on video. 

[0080] The management information file is classified roughly into the VOBI (VOB information) table and the PGCI 
20 (PGC information) table. A VOB is an MPEG program stream, while a PGC defines the playback order of cells in a 
logical playback unit in a sub-range (or whole range) of a VOB. In other words, the VOB is meaningful for MPEG, 
while the PGC is a unit the player plays back. f j 

[0081] The VOBI table consists of the number of VOBIs (Number_of_VOBIs) and a plurality of VOBIs; Each VOBI 
consists of the corresponding AV file name (AV File Name), VOB identifier on the disk (VOBJD), start address in the 
AV file (VOB_Start_Address), end address in the AV file (VOB_End_Address), VOB playback time length 
25 (VOB_Playback Time), and stream attribute information (VOB_Attribute), ; 

[0082] The PGCI table consists of the number of PGCIs (Number_of_PCGJs) and a plurality of PGCIs. Each PGI 
consists of the number of celll (Cell information) entries^and the cellls. Each cell! consists of the playback start 
time in the VOB (Ce!l_Start_Time), playback time in the VOB'(CelLPIayback_Time), playback start address in the VOB 
(Cell_Start_Address), and playback end address in the VOB (Cell_End_Address). (AV file) 
30 [0083] Next, an AV file is described by referring to FIG. 11 B. 

[0084] An AV file consists of a plurality of VOBs, which are recorded in the Av file consecutively. It should be 
noted that the AV file sometimes consists of only one VOB. The VOBs in the AV file are managed by the VOB 
information in the above-described management file. The player first accesses the management information file, reads 
the VOB start address and end address, and then accesses the VOB. Within the VOB are defined cells which are logical 
playback units. A cell is a partial playback range (or whole range). This cell allows the user to perform simple 
editing without having to operate on actual AV data. As with access information on a VOB, access information on a 
cell is maintained in the management information file. The player first accesses the management information file, 
reads the cell start address and the end address, and then accesses the cell. 

[0085] Cell address information is relative to the start of the VOB, and VOB address is relative to' the start of 
the AV file; therefore, the VOB address is added to the cell address to calculate the address in the AV file before 
40 the player accesses the AV file. 

(Still image data management information) 

[0086] Next, by referring to FIG. 12, still image data management information is described. 
45 [0087] For still image management information, VOBSIs (VOBS information), instead' of VOBIs, are stored in the 
VOBI table. Each VOBS is a set of a plurality of VOBs, each consisting of a still image and audio data synchronizing 
the still image if any. , 

[0088] A VOBSI consists of the corresponding AV file name (AV_FiIe_Name), VOBS identifier for identifying a 
particular VOBS among a plurality of VOBSs on the disk (VOBSJD), start address in the AV file (VOBS_Start_Address), 
end address in the AV file (VOBS_End_Address) ( still image management table (Video Table) containing management 
50 information on the still image data in the VOBS, and audio management information table (Audio_Tab!e) containing 
management information on the audio data in the VOBS. 

[0089] The still image management information table (Video_Tab!e) consists of at least one entry of still image 
management information (Videol), one for each still image, and information on the number of still image management 
information entries (Number_of_VideoIs). The still image management information (Videol) consists of one byte of 
55 still image data size information (Size) and one byte of pointer information (Ptr_to_Audiol) pointing to the audio 
management information in the audio management table (Audio_Table) for the audio information to be played back with 
the still image. i j , . \ ' 

[0090] The audio management information table (Audio_Table) contains audio management information (Audiol) on 
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each piece of audio data and the number of audio management information entries (Number_of_Audiols). The audio 
management information (Audiol) contains 4 bytes of audio data address information (Address), 2 bytes of audio data 
size information (Size), 2 bytes of audio playback time information (PlaybackJTime), and 1 byte of pointer 
information (Ptr_to_Audiol) pointing to the audio information (Audiol) within the audio management information table 
(Audio_Table) where post-recording audio data is stored for use as post-recording audio information when post- 
recording is used. 

[0091] The PGCI table which defines the playback sequence contains information different from that of video on a 
cell! level. A still image set celll consists of the identifier (VOBSJD) of the corresponding VOBS, start VOB 
number in the VOBS (CelLStart_Video), and end VOB number in the VOBS (CelLEnd_Video). 

[0092] This configuration allows the cells of a still image set in a range (from any stiil image to any still 
image) of the VOBS to be played back. 

[0093] Next, referring to FIG. 13, the link between still images and audio data is described. 

[0094] The still image management information (Videol) has pointer information (Ptr_to_Audiol) pointing to the 
audio management information (Audiol) in the audio table (Audio_Table). An insignificant value (=0) in this field 
indicates that the still image has no synchronizing audio data to be played back (Video#3 and Video#4). Conversely, 
a significant value, if included in the pointer information (Ptr__to_Audiol), indicates that the still image has 
synchronizing audio data to be played back (Video#1 and Video#2). 

[0095] When post-recording data is added and new audio data is recorded, pointer information (Ptr_to_Audiol) 
pointing to some other audio management information (Audiol) is created in the audio management information (Audiol) 
. As with the pointer information (Ptr_to_Audiol) in the still image management information (Videol) described above, 
a significant value in the pointer information (Ptr_to_Audiol) in the audio management information (Audiol) 
indicates that there is post-recording audio data (Audio#1 -> Audio#3). 

[0096] 1 Next, the relation between still image management information (Videol)/audio management information 
(Audiol) and actual data (AV data) in an AV file is described. 

[0097] The order of still image management information (Videol) in the still image management information table 
(Video_Table) matches the order in which still image data was recorded in the AV file. Also, the order of audio 
management information (Audiol) in the audio management information table (Audio Table) matches the order in which 
audio data was recorded in the AV file. 

[0098] Therefore, for a VOBS consisting only of still image data with no audio data, the address of a still 
image may be calculated simply by adding the still image data sizes (Size) recorded in the still image management 
information (Videol) beginning at the start of the VOBS. 

[0099] When audio data is enclosed by still images (audio 1 and audio 2), the address generated by adding the 
still image data sizes is compared with the address in the audio management information (Audiol). If they match, it 
is found that the audio data is recorded at this address and therefore the data size of the audio data is added to 
the address. By repeating this calculation, all still image data in the VOBS may be accessed. 

[0100] Next, referring to the flowchart in FIG. 14, access to still images and audio data, recorded on the 
optical disk used in the embodiment of the present invention, is described more in detail. 

[0101] First, Add indicating the current address, the variable i indicating an entry number in the still image 
management information table (Video Table), and the variable j indicating an entry number in the audio management 
information table (AudioJTable) are initialized. 

Add = VOB_Start_Address 

i = 1 

j=1 

(stepl). i- 

! !- 1 

[0102] The variable j and the number of audio management information entries (Number_of_Audiols) are compared 
and if 

j <= Number_of_Audiols 

is satisfied, control is passed to step 3 where the audio data and the address are compared. Otherwise, control is 
passed to step 5. 

(step 2) 1 

[0103] The current 'address Add and the address information in audio management information #j are compared 
and if 

Add == Audio[j].Address , 

is satisfied, the current address Add is the start address of the audio data managed by audio management information 
#j (Audio #j) and therefore control is passed to step 4 where the current address is added. If the above expression 
is not satisfied, control is passed to step 5. 



(step 3) 



20 
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[0104] The audio data size in audio management information #j (Audio! #j) is added to the current address, Add, 
the variable j is incremented, and control is passed back to step 2. , i 

Add += Audio[j].Size ( 
j++ , I 

(step 4) 

[0105] If the conditional expression in step 2 or step 4 is not satisfied, it means that the current address Add is 
a still image data address and therefore the still image address is determined. 



10 (step 5) 



[0106] Next, a check is made to see if there is a pointer to audio management information (Audiol). If there is 
a pointer, control is passed to step 7 to search for audio data synchronizing with the still image to be played back 
with the still image. If there is not such a pointer, control is passed to step 10 to play back the still image. i , i 



15 (step 6) 



[0107] The audio data to be played back in synchronization with the still image is assigned temporarily to 
Ptr_to_Audiol. 

(step 7) j 

■ 1 ; 

[0108] A search is made to see if the audio management information (Audiol) pointed to PTR_to_Audiol points to 
another audio management information (Audiol) entry. If there is a link to another audio management information 
(Audiol) entry, control is passed back to step 7. 



25 (step 8) 



30 



[0109] When it is found in step 8 that there is no more link to another audio management information (Audiol) 
entry, the audio data to be played back in synchronization with the still image is determined. 1 

■ i . i ! 

(step 9) : I 

[ i > i 

[0110] The still image data determined in step 5 and the audio data determined in step 9, if found, are played back. 

' ■ » ! 

(step 10) ; j i i 



35 [01 1 1] The variable t is incremented. 
j++ 

(step 11) 
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[01 1 2] The variable i is compared with the number of still image management information entries 
(Number_of_Videols) and if 
i <= Number_of_Videols 

is satisfied, it indicates that there is still another piece of still image data to be played back in the still 
image set (VOBS). Control is passed back to step 2. If the above expression is not satisfied, the playback of the 
VOBSends. . - i. 

(step 12) } ' 

(VOBSI data size) 

[0113] Next, the management information size for the still image set used in the embodiment is described. 

[0114] As shown in FIG. 12, management information on one still image requires 2 bytes, one byte for the still 
image size and one byte for the pointer to audio data. Thus, even if 65,000 still images are taken, the size is 

65,000 x 2 bytes = 130,000 bytes 
That is, the size is about 130KB. As compared with 1.4 MB described in the prior art, this size is as small as 10% of 1.4 
MB. 

[0115] When audio data is recorded at the same time, adding audio data to all 65,000 still images is unrealistic 
in terms of capacity even on the DVD-RAM which is a large-capacity recording medium. 

[01 1 6] If the size of one still image is 50KB, then : 



i 



. If each audio data piece is 
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4.7GB - 65,000 x 50KB = 1.45GB 



192kbps and 10 seconds, then 

i i 1. 45GB/ 192kbps x 10sec = 6,041 

This means that as many as about 6,000 audio data pieces may be recorded. Because each audio data management 
information entry requires 9 bytes, the total is calculated as: 

6,000 x 9 bytes = 54,000 bytes 
The total is 184KB, which is about 13% of the conventional management information. 

[0117] As a modification of the management method described in FIG. 12 to FIG. 14, 4 bytes of the still image 
data address information (Address) may be added to the still image management information (Videol) for each still 
image, composed of one byte of size information (Size) and one byte of pointer information (Ptr_to_Audio1) pointing 
to the audio management information, shown in FIG. 12. This means that, though the data size of the management 
information on one still image is increased to 6 bytes as compared with that of the above method, access to the 
still image data becomes easier. At this time, when there is no audio data to be played back in synchronization with 
the still image, the management information may be reduced to about 29% (6/21) of the management information data 
size (21 bytes for each still image) of the prior art shown in FIG. 9. (Block diagram of the DVD recorder) 
[0118] FIG. 15 is a block diagram of the DVD recorder used in the embodiment of the present invention. 
[0119] In the figure, the reference numeral 1501 is a user interface unit displaying information to, and 
receiving a request from, the user, 1502 is a system controller performing overall management and control, 1503 is 
an input unit consisting of a camera and a microphone, 1504 is an encoder consisting of a video encoder, an audio 
encoder, and a system encoder, 1505 is an output unit consisting of a monitor and a speaker, 1506 is a decoder 
consisting of a system decoder, an audio decoder, and a video decoder, 1507 is a track buffer, and 1508 is a drive. 
The system controller 1502 is a microcomputer or some other digital signal processor. The system controller 1502 
controls access to the optical disk as directed by the program whose flowchart is shown in FIG. 14. 
[0120] First, the recording operation of the DVD recorder is described by referring to FIG. 15. 
[0121] , First, the user interface unit 1501 receives a request from the user. The user interface unit 1501 sends 
the user request to the system controller 1502, and the system controller 1502 interprets the user request and makes a 
processing request to each module. When the user request is to take and record a still image, the system controller 
1502 requests the encoder 1504 to encode one video frame and audio data. 

[0122] ; The encoder 1504 video-encodes and system-encodes one video frame sent from the input unit 1503 and 
. sends the result to the track buffer 1507. 
[0123] Next, the encoder 1504 tells the system controller 1502 that the still image data has been created. The 
system controller 1502 records the still image data stored in the track buffer 1507 onto the DVD-RAM disk via the 
drive 1508. | 

[0124] * After encoding the video data, the encoder .1504 immediately starts audio-encodes the audio data sent from 
the input unit 1503 and sequentially transfers the generated audio data to the track buffer 1507. 

[0125] The encoder 1504 tells the system controller 1502 that audio encoding has been started. The system 
controller 1502 sequentially records the audio data stored in the track buffer 1507 onto the DVD-RAM disk via the 
drive 1508. 

[0126] A stop request from the user is sent to the system controller 1502 via the user interface unit 1501. The 
■ system controller 1502 sends the recording stop instruction to the encoder 1504. The encoder 1504 ends encoding 
after the -immediately-following audio frame is encoded, transfers all audio data to the track buffer 1507, and tells 
the system controller 1502 that encoding has finished. The system controller 1502 records all remaining audio data to 
the DVD-RAM disk via the drive 1508. 

[0127] • After finishing the above operation, the system controller 1502 creates the above -described VOBSIs and 
cellls and records them on the DVD-RAM disk via the drive 1508. At this time, it is important that link information 
(Ptr_to_Audiol) pointing to the audio management information (Audiol) in the still image management information 
(Videol) is generated so that it points to the audio management information (Audiol) of audio data recorded at the 
same time. 

[0128] When the user continuously records still images and audio data as described above, one VOBS is created. 

The VOBS is a unit in the data structure and, at the same time, a block of still images taken continuously by the 

user at the same time. A plurality of VOBSs can be created within one recording medium. 

[0129] Next, the playback operation of the DVD recorder is described with reference to FIG. 15. 

[0130] First, the user interface unit 1501 receives a request from the user. The user interface unit 1501 sends 

the user request to the system controller 1502, and the system controller 1502 interprets the user request and makes 

a processing request to each module. When the user request is to play back a PGC pointing to a still image set (VOBS) 

, the system controller 1502 reads the PGC information (PGCI) via the drive 1508 and, from VOBSJD described in the 

cell information (Celil) of the PGCI that was read, reads the VOBS information (VOBSI). 

[0131] Next, according to the flowchart described in FIG. 14, the system controller 1502 checks the address of 
the still image data to be played back, checks if there is audio data to be played back in synchronization with the 
still image data, and finds the audio data. 

[0132] ' Next, the system controller 1502 asks the drive 1508 to read the still image data first and then audio 
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data (if any) from the DVD-RAM disk and to store them into the drive 1 508. 

[0133] Then, the system controller 1502 issues a decode request to the decoder 1506. The decoder 1506 reads AV 
data from the track buffer 1507 and decodes it. The decoded data is displayed on the monitor, or output from the 
speaker, via the output unit 1505. , 

5 [0134] In this embodiment, an example of DVD-RAM is described. The present invention is not limited only to a 
DVD-RAM or an optical disk but applies to other media too. Other media include random access recording media such 
as a magneto-optical disk, magnetic disk, and semiconductor memory. 

[0135] In the embodiment, still image data VOBs and audio data VOBs are recordedjin an AV file separately from 
other VOBs. They may also be recorded in the AV file in which other VOBs are recorded. The present invention is not 
^ limited by the configuration of an AV file. j 

[0136] In the embodiment, the order of audio management information (Audiol) entries in the audio management 
information table (AudioJTable) matches the order in which data is recorded in the AV file. In essence, the order is 
not limited. However, when the order of audio management information (Audiol) entries do not match the order in 
which data is recorded in the AV file, the search for the audio management information (Audiol) is not narrowed down 
to one entry and therefore all audio management information (Audiol) must be searched. , 
*5 [0137] In this embodiment, all still images and all audio data managed by a VOBSI are recorded in an area in the 
AV file beginning at VOBS_Start_Address and ending at VOBS_End_Address. However, audio data, especially audio 
data recorded through post-recording, need not be recorded in this range but may be recorded in any position within 
the AV file as long as it is not included in a recording area (from VOBS_Start_Address to VOBS_End_Address) 
managed by some other VOBS. 

20 [0138] In addition, a one-bit playback identification flag (Playback Permission), if provided in still image 
management information (Videol) to indicate the playback enable/disable option as shown in FIG. 16, enables the user 
to specify a still image not to be played back, that is, a still image to be skipped. .This ability allows the user 
to play back only the selected still images from a large number of still images that were taken. t 
[0139] In FIG. 12, an address is represented by 4 bytes. The address may be represented by 3 bytes because the 
maximum number of sectors (2048 bytes) on a 4.7GB disk is 2,464,153 (=4.7x1024x1024x1024/2048) and therefore all 

25 sector addresses on the disk may be represented by at least 22 bits. 

[0140] The embodiment of the present invention is^ajx.optical disk on which at least still image data is recorded, 
the optical disk comprising still image set management information (VOBSI) managing synthetically a plurality of 
still image data pieces as one still image set and a variable-size still image management information table 
(Video_Table) proportional to a number of still images managed by the still image set management information (VOBSI) 
30 . When audio data to be played back in synchronization with the still images is recorded, the optical disk further 
comprises, a variable-size audio management information table (Audio_Table) proportional, in size, to the number of 
the audio data pieces to be played back in synchronization with the still images in the still image set. The still 
image management information table (Video_Table) comprises at least one still image management information (Videol) 
entry composed of a still image data size and pointer information (Ptr_to_Audiol) pointing to the audio 
management information (Audiol) to be played back in synchronization with the still image. 

35 [0141] As a result, the present invention compresses the management information on the still images and audio 
data, reducing them to a little larger than 10% of that used in the conventional configuration. 

[0142] The audio management information table (Audio_Table) comprises at least one audio management 
information (Audiol) entry composed of an audio data address, an audio data size, an audio playback time, and, when 
post-recording is used, pointer information (Ptr_to_Audio1) having a link to other audio management information 
40 (Audiol). Therefore, the present invention allows the user to perform post-recording without losing the original 
audio management information. . ; 

[0143] For each still image in the still image set, a playback identification flag (Playback_Permission), which 
indicates whether or not the still image is to be displayed during playback, is provided in the still image 
management information (Videol). Therefore, the present invention allows the user to specify that unnecessary still 
images be skipped during playback. , 
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Claims 

1. A recording medium comprising: 



a still image data area (102) capable of storing a plurality of still image data (VOB) pieces therein; and 

an area (102) storing still image set management information (VOBSI) therein 'for managing synthetically a 
part or whole of still image data (VOB) in said still image data area as one still image set (VOBS), ' 

said still image set (VOBS) having said corresponding still image set management information (VOBSI). ' 

A recording medium according to claim 1 further comprising: j i 

an area (102) capable of recording audio data therein, wherein said still image |set management information 
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(VOBSI) comprises still image management information (Video!) for each still image, wherein said still image 
management information (Videol) comprises a data size of said still image and information (PtrJo_Audiol) 
indicating whether or not the audio data to be played back in synchronization with said still image is recorded. 



A recording medium according to claim 1 further comprising: 

t 

an area (102) recording therein audio data to be played back in synchronization with said still image, wherein 
said still image set management information (VOBSI) comprises audio management information (Audiol) 
. comprising a data size of said audio data and a playback time of the audio data. 



A recording medium according to claim 1 further comprising: 

an post-recording area (102) capable of recording therein audio data after said still image is recorded, 
wherein said still image set management information (VOBSI) comprises pointer information (PtrJo_Audiol) 
specifying the area of audio management information managing the audio data recorded in said post-recording 
area. 



A recording medium according to claim 2, wherein said still image management information (Videol) comprises, for 
each still image, a playback identification flag (Playback_Permission) specifying whether or not the still image 
will be displayed during playback. 

A recording medium according to any of claims 1 to 5, wherein said recording medium is an optical disk to and 
from which data may be written and read. 

. A recording unit recording data on the recording medium according to claim 1 , said recording unit comprising: 

recording means (1508) for recording the still image data in said still image data area (102) on said 
recording medium; and 

means (1502) for creating still image management information (Videol) when said still image data is recorded, 
said still image management information (Videol). managing said still image data, wherein said still image 
management information (Videol) is stored in the area (102) by said recording means (1508), said area 
containing said still image set management information (VOBSI) on said recording medium. 



. A recording unit according to claim 7, further comprising: 

recording means, (1508) for recording audio data on said recording medium; and 

means (1502) for creating audio management information (Audiol) when the audio data to be played back in 
synchronization with said still image data is recorded, said audio management information managing said 
audio data, wherein said audio management information (Audiol) is written into the area by said recording 
means (1508), said area containing said still image set management information (VOBSI). 



9. A recording unit according to claim 7, further comprising: 

recording means (1508) for recording audio data on said recording medium; 

means (1502) for creating audio management information (Audiol) when the audio data is recorded after said 
still image is recorded, said audio management information managing said added audio data; and 

means for creating pointer information (Ptr_to_Audiol) specifying the area containing the audio management 
information (Audiol) managing the added audio data, wherein said pointer information (Ptr_to_Audiol) is 
written into the area by said recording means (1508), said area containing said still image set management 
information (VOBSI). 



10. A recording unit according to claim 7, further comprising means (1502) for setting, for each still image, a 
playback identification flag (Playback_Permission) into said still image management information (Videol), said 
playback identification flag (Playback_Permission) specifying whether or not said still image will be displayed 
during playback. 

11. A recording unit according to any of claims 7 to 10, wherein said recording medium is an optical disk and said 
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recording unit is an optical disk recorder capable of writing the data onto said optica! disk. * i 

; t 

12. A playback unit playing back data recorded on the recording medium according to claim 1, said playback unit 
comprising: 

means (1502) for identifying the area where the still image to be played back is recorded based on 
information recorded in the still image management information (Video!) in said still image set management 
information (VOBSI); and , 

means (1508) for accessing the area where said still image data is recorded on said recording medium and 
for reading the data therefrom. 

13. A playback unit playing back data recorded on the recording medium according to claim 3, said playback unit 
comprising: ■ t 

means (1502) for identifying the area where the still image to be played back is recorded based on 
information recorded in the still image management information (Videol) in said still image set management 
information (VOBSI); 

means (1508) for accessing the area where said still image data is recorded on said recording medium and for 
reading the data therefrom; 

means (1502), in a case that an information (Ptr_to_Audio) indicating that recording of an audio data to be 
played back in synchronization with said still image when said still image is played back, for .determining 
an area for accessing the audio data to be reproduced on the basis of said audio management information of 
the audio data; and 

25 means (1505) for playing back said still image and said audio data. x 

14. A playback unit playing back data recorded on the recording medium according to claim 5, comprising: 

means (1502) for identifying the area where the still image to be played back is recorded based on 
30 information recorded in the still image management information (Video!) in said still image set management 

information (VOBSI); . I 

means for accessing the area where said still image data is recorded on said recording medium and for 
reading the data therefrom; and t 

35 means (1502) for preventing said still image from being played back when said playback identification flag 

(Playback_Permission) included in said still image management information (Videol) indicates a playback 
disable status. . 
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f ■ i ! ; 

15. A playback unit according to any one of claims 12 to 14, wherein said recording medium is an optical disk and 
said playback unit is an optical disk player capable of reading data from said optica! disk. ! 

I ■: i • . i : 
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